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Abstract 

In this paper we give an explicit and algorithmic description of Graver basis for 
the toric ideal associated with a simple undirected graph and apply the basis for 
testing the beta model of random graphs by Markov chain Monte Carlo method. 
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in 

^ . Random graphs and their applications to the statistical modeling of complex networks 
Q i have been attracting much interest in many fields, including statistical mechanics, ecology. 



biology and sociology (e.g. Newman [10|, Goldenberg et al [6|). Statistical models for 



random graphs have been studied since Solomonoff and Rapoport [19| and Erdos and 
Renyi [sj introduced the Bernoulli random graph model. The beta model generalizes 
^ . the Bernoulli model to a discrete exponential family with vertex degrees as sufficient 
! statistics. The beta model was discussed by Holland and Leinhardt jsj in the directed 
case and by Park and Newman 15 1, Blitzstein and Diaconis jl| and Chatterjee et al 
in the undirected case. The Rasch model 17 , which is a standard model in the item 



response theory, is also interpreted as a beta model for undirected complete bipartite 
graphs. In this article we discuss the random sampling of graphs from the conditional 
distribution in the beta model when the vertex degrees are fixed. 

In the context of social network the vertices of the graph represent individuals and 
their edges represent relationships between individuals. In the undirected case the graphs 
are sometimes restricted to be simple, i.e., no loops or multiple edges exist. The sample 
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size for such cases is at most the number of edges of the graph and is often smalL The 
goodness of fit of the model is usually assessed by large sample approximation of the 
distribution of a test statistic. When the sample size is not large enough, however, it is 
desirable to use a conditional test based on the exact distribution of a test statistic. For 
the general background on conditional tests and Markov bases, see Drton et al 

Random sampling of graphs with a given vertex degree sequence enables us to numer- 
ically evaluate the exact distribution of a test statistics for the beta model. Blitzstein 
and Diaconis [ij developed a sequential importance sampling algorithm for simple graphs 
which generates graphs through operations on vertex degree sequence. In this article we 
construct a Markov chain Monte Carlo algorithm for sampling graphs by using the Graver 
basis for the toric ideal arising from the underlying graph of the beta model. 

A Markov basis jsj is often used for sampling from discrete exponential families. Al- 
gebraically a Markov basis for the underlying graph of the beta model is defined as a set 
of generators of the toric ideal arising from the underlying graph of the beta model. A set 
of graphs with a given vertex degree sequence is called a fiber for the underlying graph 
of the beta model. A Markov basis for the underlying graph of the beta model is also 
considered as a set of Markov transition operators connecting all elements of every fiber. 
Petrovic et al 16| discussed some properties of the toric ideal arising from the model of 
[sj and provided Markov bases of the model for small directed graphs. Properties of toric 
ideals arising from a graph have been studied in a series of papers by Ohsugi and Hibi 



Hi, (12|, [13|). 



The Graver basis is the set of primitive binomials of the toric ideal. Applications of 



the Graver basis to integer programming are discussed in Onn [IJ]. Since the Graver 
basis is a superset of any minimal Markov basis, the Graver basis is also a Markov basis 
and therefore connects every fiber. When the graph is restricted to be simple, however, 
a Markov basis does not necessarily connect all elements of every fiber. A recent result 
by Hara and Takemura 0] implies that the set of square-free elements of the Graver basis 
connects all elements of every fiber of simple graphs with a given vertex degree sequence. 
Thus if we have the Graver basis, we can sample graphs from any fiber, with or without 
the restriction that graphs are simple, in such a way that every graph in the fiber is 
generated with positive probability. 

In the sequential importance sampling algorithm of [H the underlying graph for the 
model was assumed to be complete, i.e., all the edges have positive probability. In our 
approach we can allow that some edges are absent from the beginning (structural zero 
edges in the terminology of contingency table analysis), such as the bipartite graph for 
the case of the Rasch model. In fact the Graver basis for an arbitrary graph is obtained 
by restriction of the Graver basis for the complete graph to the existing edges of G 
(cf. Proposition 4.13 of Sturmfels (2o[). Moreover our algorithm can be applied not 
only for sampling simple graphs but also for sampling general undirected graphs without 
substantial adjustment. These are the advantages of the Graver basis. 

The Graver basis for small graphs can be computed by a computer algebra system 
such as 4ti2 (4ti2 team 2l|). For even moderate-sized graphs, however, it is difficult to 
compute the Graver basis via 4ti2 in a practical amount of time. In this article we first 
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provide a complete description of the Graver basis for an undirected graph. In general 
the number of elements of the Graver basis is too large. So we construct an adaptive 
algorithm for sampling elements from the Graver basis, which is enough for constructing 
a connected Markov chain over any fiber. The recent paper of Reyes et al 18| discusses the 
Graver basis for an undirected graph and gives a characterization of the Graver basis. We 
give a new description of the Graver basis, which is more suitable for sampling elements 
from the Graver basis. 

The organization of this paper is as follows. In Section [2] we give a brief review on 
some statistical models for random graphs and clarify the connection between the models 
and toric ideals arising from graphs. In Section [3] we provide an explicit description of 
the Graver basis for the toric ideal associated with an undirected graph. Section H] gives 
an algorithm for random sampling of square-free elements of the Graver basis. In Section 
|5]we apply the proposed algorithm to some data sets and confirm that it works well in 
practice. We conclude the paper with some remarks in Section [61 



2 The beta model of random graphs 

In this section we give a brief review of the beta model for undirected graphs according 
to Chatterjee et al 

Let G be an undirected graph with n vertices V{G) = {1,2, ... ,n}. Here we assume 
that G has no loop. Let E = E{G) be the set of edges. For each edge G E, let a 

non-negative integer Xij be the weight for and denote x = {xij \ {i,j} G E}. x is 

considered as an \E\ dimensional integer vector. We assume that an observed graph H is 
generated by independent binomial distribution B{nij,pij) for each edge {i,j} G E, i.e.. 



X. 



B{nij,pij) with 



Then the probability of H is described as 



(1) 

The model ([1]) is called the beta model [2|. Note that if Xij = then the observed graph 
H does not have an edge even if G E{G) for the underlying graph G. 

This model was considered by many authors (e.g. Park and Newman 15|, Blitzstein 
and Diaconis [l[ and Chatterjee et al Q). The pi model for random directed graphs 
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by Holland and Leinhardt jsj can be interpreted as a generalization of the beta model. 
When G is a complete bipartite graph, the beta model coincides with the Rasch model 



17[. The many-facet Rasch model by Linacre |9[, which is a multivariate version of the 
Rasch model, can be interpreted as a generalization of the beta model such that G is a 
complete fc-partite graph. 

Let di, . . . ,dnhe a. degree sequence, i.e., di := J2j-{ij}&E^ij ^s^ch vertex i. Denote 
d := {di, . . . ,dn). The sufficient statistic for ([1]) is d. Let A : |V| x \E\ denote the 
incidence matrix between vertices and edges of G. Then it is easily seen that x and d are 
related as 

Ax = d. 

A set of graphs (without restriction to be simple) J-^ = {x > \ Ax = d} with 
a given degree sequence d is called a fiber for A (or for the underlying graph G). An 
integer array z of the same dimension as x is called a move if Az = 0. A move z is 
written as the difference of its positive part and negative part a.s z = z^ — z^ . Since 
Az = Az^ — Az~ , every move is written as the difference of two graphs in the same 
fiber. A finite set of moves is called a Markov basis for the incidence matrix A if for every 
fiber any two graphs are mutually accessible by the moves in the set [sj. By adding or 
subtracting moves in a Markov basis, we can sample graphs from any fiber in such a way 
that every graph in the fiber is generated with positive probability. Note that Xij in the 
beta model ([T]) is restricted as < Xij < riij. We denote the subset of the fiber J^d with 
this restriction as J-'d,n = {x \ Ax = d,0 < Xij < Uij, G E}. 

To assess the goodness of fit of the beta model we usually utilize a large sample approx- 
imation of the distribution of a test statistics. However, when not large enough, 
it is not appropriate to use the large sample approximation. Especially, as mentioned in 
Section [H graphs are restricted to be simple {uij = 1) in some practical problems. For 
a simple graph, Xij, {i,j} € E, is either zero or one. A Markov basis for the incidence 
matrix A guarantees the connectivity of every fiber J-^ if the restriction that graphs are 
simple is not imposed. Under the restriction, however, a Markov basis does not necessar- 
ily connect the subset J-^.i of the fiber J-^. For example, consider the beta model with 
the underlying graph G in Figure [Hand Uij = 1 for each edge {i,j} G E. It can be shown 
that a set of all 4-cycles in G is a Markov basis for the incidence matrix of G. However 
X and y in Figure [1] are not mutually accessible by 4-cycles under the restriction that 
graphs are simple. 




Figure 1: Example graphs. 
For a given x, supp(a;) = {e \ x^ > 0} denotes the set of observed edges of x. For two 



moves Zi, Z2, the sum Zi + Z2 is called conformal if there is no cancellation of signs in 
zi + Z2, i.e., = supp(zj^) n supp(2;^) = supp(2]~) fl supp(2^). The set of moves which 
can not be written as a conformal sum of two nonzero moves is called the Graver basis. 
The Graver basis is known to be a Markov basis [e.g. 3]. A move is square-free if the 
absolute values of its elements are or 1. By the same augment of Proposition 2.1 of 
Hara and Takemura 0], we can obtain the following proposition. 

Proposition 1. The Graver basis for the underlying graph of the beta model connects 
all elements of every fiber. Furthermore, the set of square-free moves of the Graver basis 
connects all elements of every fiber with the restriction of simple graphs. 

Proof. Let x, y be two elements of the same fiber. The difference y — x is written as a 
conformal sum of primitive moves: 

y - X = zi-\ \- Zr (2) 

where Zi,l < i < r, are elements of the Graver basis. Since there is no cancellation of 
signs on the right hand side, x + Zi + ■ ■ ■ + Zf^ belongs to the same fiber for k = 1, . . . ,r. 
Therefore the Graver basis connects all elements of every fiber. 

Suppose Uij = 1 for every {i,j} E E in the setting of the beta model. It is easy to see 
that each Zj is square-free in ([2]). It means that the set of square- free moves of the Graver 
basis connects all elements of every fiber with the restriction of simple graphs. □ 

Therefore it suffices to have the Graver basis to sample graphs from any fiber with or 
without the restriction that graphs are simple. In the next section we derive the Graver 
basis for the underlying graph of the beta model 

3 Graver basis for an undirected graph 

In this section we will give two characterizations of the Graver basis for an undirected 
graph. Theorem [T] in Section 13.21 is the main result of this paper which gives a necessary 
and sufficient condition for a element of the Graver basis as a sequence of vertices. Propo- 
sition |3l which is used for the proof of Theorem [H gives a characterization of the Graver 
basis through recursive operations on the graph, which is of some independent interests. 

3.1 Preliminaries 

Let G = {V{G), E{G)) be a simple connected graph with V{G) = {1,2, ...,n} and 
E{G) = {61,62, . . . ,6m}. A walk connecting i G V{G) and j e V{G) is a finite sequence 
of edges of the form 

W = {{il, ^2}, {«2, ^3}, • • • , {iq, iq+l}) 

with ii = i,iq+i = j. The length of the walk w is the number of edges q of the walk. 
An even (respectively odd) walk is a walk of even (respectively odd) length. A walk w is 
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closed if i = j. A cycle is a closed walk w = {{ii, ^2}, {^2, "^3}, • • • , {iq, h}) with ii 7^ ii> for 
every 1 </</'< g. 

For a walk u;, let V{w) = {zi, . . . , ig+i} denote the set of vertices appearing in w 
and let E{w) = {{ii, 12}, {^2, is}, ■ ■ ■ , {iq, "^ij+i}} denote the set of edges appearing in w. 
Furthermore let = {V{w), E{w)) be the subgraph of G, whose vertices and edges 
appear in the walk w. 

In order to describe known results on the toric ideal Ig arising from an undirected 
graph G, we give an algebraic definition of Iq- Let K[t] = K[ti, . . . ,tn] be a polynomial 
ring in n variables over K. We will associate each edge = {hj} G E{G) with the 
monomial = titj G K[t\. Let K[s\ = K[si, . . . , Sm] be a polynomial ring in m = |-E'(G')| 
variables over K and let vr be a homomorphism from A'[s] to K[t] defined by vr : 1^. 
Then the toric ideal Iq of the graph G is defined as 

Ig = ker(7r) = {/ G K[s] \ n{f) = 0}. 

A binomial f = u — v E Ig is called primitive if there is no binomial g = u' — v' E Ig-, 
g 7^ 0,/, such that u'\u and v'\v. The Graver basis of Ig is the set of all primitive 
binomials belonging to Ig and we denote it by Q{Ig)- If we write the monomials u,v as 
u = s^, V = s^, then u — v E Ig if and only if a; — y is a move. Furthermore u — v E Ig is 
primitive if and only if supp(a3) nsupp(y) = and x — y can not be written as a conformal 
sum of two nonzero moves. 

For a given even closed walk w = (ej^, Cj^, . . . , Cj^^) we define a binomial fyj E Ig as 

p p 

tw = f^- f~, where = H ^i^^-i' •^'^ = 11 ^J^^- 

k=l k=l 

An even closed walk w' is a proper subwalk of w, if g^, \ and g~, \ f~ hold for the 
binomial g = g^, — 5'.^/ (7^ fw)- Note that even if there is a proper subwalk w' of an even 
closed walk w, w' dose not necessarily go along with w, i.e., the edges of w' may not appear 
as consecutive edges of w. An even closed walk w is called primitive, if its binomial f^) is 
primitive. Then the primitiveness of w is equal to non-existence of a proper subwalk of 
w. 

A characterization of the primitive walks of graph G, which gives a necessary condition 
for a binomial to be primitive, was given by Ohsugi and Hibi 12 . 

Proposition 2 ([lil)- Le.t G be a finite connected graph. If f E Ig is primitive, then we 
have f = fyj where w is one of the following even closed walks: 

(i) w is an even cycle ofG. 

(ii) w = (ci,C2), where c\ and 02 are odd cycles of G having exactly one common vertex. 

(Hi) w = (ci, Wi, C2, W2); where Ci and C2 are odd cycles of G having no common vertex 
and where Wi and W2 are walks of G both of which contain a vertex vi of ci and a 
vertex V2 of C2. 

Every binomial in the first two cases is primitive but a binomial in the third case is 
not necessarily primitive. 
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3.2 Characterization of primitive walks 



In this subsection we give a simple characterization of the primitive walks of a graph 
G as sequences of vertices. Express an even closed walk w as a sequence of vertices: 
{ii,i2, ■ ■ ■ ,i2p,ii), where ii = i2p+i- Let jj^wii) = #{1 < ^ < 2p | = i} denote the 
number of times i is visited in the walk w before it returns to the vertex ii. Consider the 
following condition for the even closed walk w. 

Condition 1. (i) ij^w{i) G {lj2} for every vertex i G V{w). (ii) For every vertex j G 
V{w) with H^wU) = 2 and j = ii = iv , !</</'< 2p, the closed walks w{ = {ii, . . . ,ii') 



no crossing chords in Figured when adding a chord {j,j} to the figure for every vertex 
j G V{w) with = 2. 

Using Condition [H we can characterize the Graver basis for a graph G as follows. 

Theorem 1. A binomial f E Ig is primitive if and only if there exists an even closed 
walk w with fw = f satisfying Condition [H 

Remark 2. It follows from the definition of primitive walks and Theorem U\ that if an 
even closed walk w is primitive, every even closed walk w' with f^' = fw is primitive and 
satisfies ConditionUi 



Remark 3. As mentioned in SectionUl there is another characterization of Graver basis 
in Theorem 3.1 of Reyes et al fT^J . It also gives a necessary and sufficient condition 
for the primitiveness of even closed walks, by using some new graphical concepts such as 
"block" and "sink". Our characterization in Theorem U\ gives a simpler description of 
Graver basis, because it does not need any new graphical concepts. Furthermore it is more 
convenient in the algorithmic viewpoint: When an even closed w is given as a sequence of 
vertices or edges, we can easily determine if w is primitive by checking directly Condition 
U\ without distinguishing any graphical objects. 





w 



Figure 2: Even closed walk w. 
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Before proving Theorem [T], we state another characterization of primitive walks given 
in Proposition |3] below. In order to that, we need some more definitions on graphs. For a 
walk w = (cj^, 6^2, • • • , Cj^), let W = W{w) denote the weighted subgraph (y{w), E{w), p) 
of G where p : E{w) — )■ Z is the weight function defined by p(e) := #{/ | ej^^^^ = e} — #{/ | 
Cj^j = e} for each edge e G E{w). For simplicity, we denote a weight +1 (respectively 
— 1) by + (respectively — ) in our figures. For a vertex i G V{w), we define two kinds of 
degrees of vertex i: 

degG^ii) = #{e e E{w) | i G e}, 
degH^(z) = ^ |p(e)|. 

e(iE{w):i(ie 

AegQ^ii) is the usual degree of i in Gw Note that the same weighted graph W might 
correspond to two different even closed walks w, w', i.e. W{w) = W{w'). Given a weighted 
graph W, we say that w spans W if W = W{w) and {cj^ \ l.odd} fl {ej^ \ /:even} = 0. 
Now we define two operations, contraction and separation, on a weighted graph W. 

• Let e = {i,j} G E{w) be an edge with |p(e)| = 2, whose removal from increases 
the number of connected components of the remaining subgraph. Contraction of 
e is an operation as shown in Figure [31 That is, it first replaces W by W = 
iV{w) \ {j},E',p') where E' consists of all edges of W contained in V{w) \ {j}, 
together with all edges {a, i}, where {«, j} is an edge of W different from e. Then, 
it defines p' by inversion of the signs of weights of edges belonging to the i-side of 
W. 




Figure 3: Contraction. 



• Let i G V{w) be a vertex with degQ^{i) = deg^(i) = 4, such that the removal 
of i increases the number of connected components of the remaining subgraph and 
the positive side as well as the negative side of i fit to one of three cases (a)-(c) 
(respectively to the sign reverse cases) in Figure HI Separation of i is an operation 
as shown in Figure HI That is, it first deletes the vertex i and all edges connected 
to i on W. Then, in the case of (a), it adds a new edge {ki, with weight +1. In 
the case of (b), it redefines p{{ki, /C2}) := +2 and then contracts {ki, /C2}, where we 
assume that the contraction of {/ci,/c2} is possible. In the case of (c), it redefines 
p{{ki, /C2}) := 0. We call this {ki, /C2} an edge with weight 0. The sign reverse cases 
are defined in the same way. 
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(contraction) 




Figure 4: Separation. 



Note that the separation is not defined for any vertex i with degQ^{i) = deg(y(i) = 4, 
if i fits to none of three cases (a)-(c) in Figure |H The vertex i in Figure [5] is such an 
example, because its positive side fits to none of three cases (a)-(c) in Figure |H 

Let insertion and binding be the reverse operations of contraction and separation, 
respectively. With these operations, consider the following condition for an even closed 
walk w = {ej„ej^,...,ejj. 

Condition 2. (i) {ej^ \ l:odd} fl {ej, | l:even} = 0. Every vertex i G V{w) satisfies 
degy^{i) G {2,4}. For every vertex i with degi^{i) = 4, its removal from G„, increases the 
number of connected components of the remaining subgraph, (ii) Let W be a graph obtained 
by recursively applying contraction and separation of all possible edges and vertices in W . 
Then each connected component of W is an even cycle or an edge with weight 0. 




Figure 5: A vertex i whose separation is not defined. 



Proposition 3. For an even closed walk w, the binomial fw is primitive if and only if w 
satisfies Condition\^ 
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We establish some lemmas to prove Proposition [31 Our proof also shows that W in 
Condition [2] does not depend on the order of application of contractions and separations 
excepting the sign inversion of weights of edges of each connected component in W. 

Lemma 1. If w is a primitive walk, w satisfies (i) in Condition\^ 

Proof. Consider a vertex i G V{w). Since w is closed, deg^(i) is even. Furthermore, 
since w is primitive, {ej^ \ /:odd} fl {ej^ \ /:even} = holds which implies that there 
is no cancellation in the calculation of weight on any edge. Then, a half of the weight 
deg^(2)/2 is assigned as positive weights and other half deg^{i)/2 is assigned as negative 
weights to the edges connected to i on W . Therefore deg^(i) G {2, 4, 6, . . .}. Now suppose 
degyi/(z) > 6. Consider that we start from a vertex i along an edge with positive weight 
and go along the walk w or its reverse until returning back to i again for the first time. 
Since w is primitive, we have to come back to i along an edge with positive weight for the 
first time. Let us continue along w or its reverse until returning back to i. By the same 
reasoning, the last edge of this closed walk has a negative weight. This implies that this 
even closed walk becomes a proper subwalk of w, a contradiction to the primitiveness of 
w. Therefore deg^ii) is 2 or 4. 

To prove the remaining part, let i G V{w) be a vertex with (leg^r{i) = 4 and consider 
all closed walks on W, where the edge starting from i and the edge coming back to 
i have positive weights. Let be the set of vertices other than i which appear in 
one of these walks and V~ is defined in the same way. Then U U {i} = V{w) 
holds. First, we show fl V~ = 0. Suppose that there exists a vertex j G V'^ fl 
V~ . Then, as shown in Figure O there are two closed walks ({z, z^}, F^, F^, and 
{{i,i^},T^ , {i2 This implies that we can construct a proper subwalk of w by 

the combination of {i,z^},F^(/c = 1,2), and , {i^ ,i}{l = 1,2), a contradiction to the 
primitiveness of w. Therefore fl V~ = 0. Second, suppose that the removal of the 




Figure 6: Case that there exists a vertex j G fl V . 

vertex i from does not increase the number of connected components of the remaining 
subgraph. Then, there are vertices G V^, v~ G V~ such that {v~^, w~} G E{w), because 
n V~ = holds as shown above. Hence, as shown in Figure [TJ an even closed walk 
{{i,i'l},T'l , {v^^ ,v~},Tf , {ij' ,i}) is a proper subwalk of w for appropriate k,l G {1,2}, 
k ^ I, which contradicts to the primitiveness of w. Therefore the removal of i from G^, 
increases the number of connected components of the remaining subgraph. □ 

In the following four lemmas, we show that contraction, separation, and these inverse 
operations preserve the primitiveness of an even closed walk. The proofs of lemmas are 
postponed to Appendix. 
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V V 




Figure 7: Case that there exists an edge {v^,v }. 



Lemma 2. Let an even closed walk w be primitive and W be the weighted graph which 
is obtained by a contraction for an edge with its weight ±2 on W. Then any even closed 
walk w spanning W is primitive. 

Lemma 3. Let an even closed walk w be primitive and Wi,W2 be the weighted graphs 
obtained by the separation of a vertex i. Then any even closed walks Wi{l = 1,2) spanning 
Wi{l = 1,2) are primitive or of length two with f^^ =0. 

Lemma 4. Let w be a primitive walk and let W be the weighted graph obtained by the 
insertion to i with deg-^{i) = A on W . Then any even closed walk w spanning W is 
primitive. 

Lemma 5. Let each Wi (/ = 1,2) be a primitive walk or a closed walk with length two, 
and W be the weighted subgraph obtained by binding ofWi and W2. Then any even closed 
walk w spanning W is primitive. 

We now give proofs of Proposition [3] and Theorem [H 

Proof of Proposition 0. Let w he a primitive walk. From Lemma [1] w satisfies (i) in 
Condition [2] and every edge e with |p(e)| = 2 can be contracted. Furthermore, it is easy 
to see that every vertex i with degy^/{i) = 4 can be separated after recursively applying 
contractions of all possible edges. Therefore deg-^^{i) = 2 holds for every vertex i on W. 
From Lemmas [2] and [3], each even closed walk corresponding to the connected component 
of W is primitive or of length two. Then, every connected component of W is an even 
cycle or an edge with weight 0, because from Proposition [2] every primitive walk includes 
a vertex i with degy^,{i) = 4 if it is not an even cycle. Therefore, a primitive walk w 
satisfies Condition [2J Conversely, suppose an even closed walk w satisfies Condition [21 
From Proposition [21 and Lemmas [H and [5l w is primitive. □ 

Proof of TheoremUl Let w be a primitive walk. From Lemma [H #^(i) G {1,2} holds 
for each vertex i G V{w) and V{wl) fl V{w2) = {j} holds for each vertex j G V{w) 
with #w(j) = 2. By the primitiveness of w, the closed walks wl = (j, ■ ■ ■ ,j) and = 
(j, . . . ,ii, . . . , j) along w are odd closed walks. Therefore w satisfies Condition [H 

Conversely, let w be an even closed walk with Condition [H From Proposition [3l it 
suffices to show that w satisfies Condition [2l The condition (i) in Condition [2l follows from 
Condition [H Then, it is enough to confirm that w satisfies the condition (ii) in Condition 

El 
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First, we claim that every edge e G E{w) with |p(e)| = 2 can be contracted and 
every vertex j with = 2 and degG.„(j) = 4, i.e. deg^^O') = deg^ij) = 4, can be 

separated. The case of contraction is obvious from Condition [1] We confirm the case of 
separation. Consider the vertex j in Figure |H1 If an edge {ki, ^2} dose not exist or exists 




Figure 8: A vertex j with degQ^^ {j) = deg^(j) = 4. 



with weight +1, it belongs to the case (a) or (b) in Figure HI respectively. Let us consider 
the case that there exists an edge {/ci, ^2} with weight —1 and suppose that the vertex ki 
connects to more than three edges as shown in Figure M Then, j, ki and k2 appear in w 




Figure 9: A vertex j which does not exist in w with Condition [H 

like (j, /ci, . . . , ki, k2,j) or (j, /ci, . . . , fci, /c2, • • • , k2,j), because V{w{)nV{wi) = {j} holds. 
This implies that {ki, . . . , ki) is even as shown in Figure [TOl which contradicts Condition 
[U Hence the case with {ki, ^2} with weight —1 belongs to (c) in Figure ID Therefore the 
claim is confirmed. 









j or JC 











Figure 10: Case that there exists a vertex j in Figure |9l 



Second, we verify that contraction and separation on W preserve Condition [T] Con- 
sider the case of contraction of an edge {i,j} G E{W). From Condition [H such i,j 
appear in w as w = (zi, . . . , z/^, z, j, z/j, . . . , z/3, j, z, z^, . . . , zi). The contraction of 
is equivalent to replacing w by (zi, . . . . . . , ii^yi, ii^, ■ ■ ■ , ii). This change causes 

the decrease of two edges from w, and preserves Condition [T] The case of separation is 
checked in the same way. 

Finally, consider the weighted graph W obtained by all possible contractions and 
separations on W. From the claims above, every connected component of W satisfies 
Condition [Hand has no vertex j with = 2, i.e. an even cycle or an edge with weight 

0. Therefore w satisfies Condition |2J □ 
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4 Algorithm for generating elements of Graver basis 



In this section we present an algorithm for generating elements randomly from the Graver 
basis for a simple undirected graph. As shown in Proposition [H for testing the beta model 
of random graphs with Uij = 1, we only need square-free elements of the Graver basis. 
Therefore the restriction to square-free elements of our algorithm will be discussed in 
Remark |H Theorem [1] guarantees the correctness of our algorithm. 

We need some tools in order to construct an algorithm. Let T be a weighted tree 
(y(T), E{T), fi) where /i : V(T) — )■ Z>2 = {2,3,...} is a weight function. For this 
weighted tree T, let us consider the following condition. 

Condition 3. For each vertex vt G V(T), deg(fr) < fJ'ivr) and deg{vT) = f^ivr) mod 2. 

With these tools, let us consider generating an element of the Graver basis for a simple 
undirected graph G = {V{G), E{G)). For simplicity, first consider the case that that G is 
complete. We call an edge e with |p(e) | = 2 a cycle in Gy^ for an even closed walk w in this 
section. We will discuss later the case that G is not complete. Let T = {V(T), E(T), fi) 
be a weighted tree satisfying Condition [3] and the following equation: 



Then, we can construct a primitive walk in G using T as follows. First, we assign the set 
of vertices V^^ C V{G) with \Vyj.\ = fi^vx) for each vertex vt G V(T) under the equation 



and every vertex v G V{G) is assigned at most twice. Equation ([3]) guarantees that this 
assignment is possible. Second, we make cycles in G by arbitrarily ordering the vertices 
Vy^. Then we make a subgraph of G by taking the union of these cycles. Finally, we 
obtain a closed walk by choosing a root vertex from this subgraph and going around it. 
It is easy to see that this closed walk is primitive by Theorem [H 

Conversely we can construct a weighted tree with Condition [3] and ([3]) from each 
primitive walk. Let w be a primitive walk. First, the vertex set V(T) is constructed by 
creating a vertex Vc of T for each cycle c in G^. Second, the edge set E{T) is obtained 
by adding edge {v^, Vc'} to E(T) for each pair of cycles c, c' in Gy, with V{c) fl V{c') ^ 0. 
Then, we assign weight /i(wc) := |^(c)| to each vertex Vc G ^(T). 

Therefore, once we have a weighted tree T with Condition [3] and ([3]), we can construct 
an element of the Graver basis for G. Such a tree T is constructed by the following 
algorithm. 

Algorithm 1 (Algorithm for constructing an weighted tree). 
Input : A complete graph G = {V{G), E{G)) . 

Output : A weighted tree T = {V{T), E{T), ^) with Condition\^ and 




(3) 
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1. LetV{T),E{T) be empty sets and n := \V{G)\. 

2. Add a root vertex r to V{T). 

3. Assign yu(r) a weight from {2, 3, ... , n} randomly. 
4- Grow T by the following loop. 

(a) For each vertex vt € V(T) which is deepest from r, add edges {vt,v^} to 
E{T) and the endpoints (i = 0,1, . . . , 1^^) to V(T), where the number ly.^ 
is randomly decided under the following two conditions: 

+ 1 < Kvt). 
• + 1 = ij,{vt) mod 2. 

(b) For each new vertex v]^, assign ^{v\) a weight from {2, 3, . . . , ra — a} randomly, 
where a := Y.vt<^v{t) - \E{T)\. 

(c) Recompute a and if a > n, delete all new vertices and edges in the above (a) 
and break the loop. 

(d) If the total number of new edges is equal to 0, break the loop. 

(e) Return to (a). 

5. If \V{T) \ = 1 and fi{r) is odd, change fi{r) to /u(r) — 1 or fi(r) + 1. 

6. If \V(T)\ > 1 and T has a leaf with even weight, subtract or add 1 to the weight. 

7. Output T. 

Algorithm [T] provides a simple algorithm for generating an element of Graver basis as 
follows. 

Algorithm 2 (Algorithm for generating an element of Graver basis). 
Input : A complete graph G = {V{G), E{G)) . 
Output : A primitive walk w. 

1. Construct a weighted tree T with Condition\^ and ^ by AlgorithmUi 

2. Construct a primitive walk by assigning vertices of G and ordering them randomly. 

3. Output w. 

Since there is no restarts in Algorithm [21 it has a fixed worst case running time for 
a complete graph G. In each step, the algorithm performs 0(|l^(G)|) operations. Then 
it generates one element of the Graver basis for G in 0(|\^(G)|) time. A demonstration 
for the case of a complete graph G with = 25 is shown in Figures [TT] and [T2l The 

output of this demonstration is a primitive walk w with |V^(G'„,)| = 21 in Figure [T2l 

Remark 4. For the case that an input graph G is not complete, the elements of the Graver 
basis for G can be generated by throwing away elements with supports not contained in 
G (Proposition 4-13 of Sturmfels \2W). In fact this is the advantage of considering the 
Graver basis. The restriction for generation of square-free elements of the Graver basis 
can be realized by a slight modification in AlgorithmUi In fo.ct, it suffices to change merely 
{2, 3, 4, . . .} to {3, 4, . . .} in Step 3 and in (b) of Step 4 in AlgorithmUi 
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Over! 




Stop : : Stop 



(S': (5) (S} (5} Output 



Figure 11: Demonstration of Algorithm [T] 




Figure 12: Demonstration of Algorithm El 



Remark 5. The output of Algorithmic is not uniformly distributed over all elements of 
Graver basis. The distribution depends on how to implement the randomness in Step 3 
and in (b) of Step 4 in Algorithmic 

Algorithm |2] allows us to uniformly sample graphs with the common degree sequence 
via Metropolis-Hastings algorithm with the Graver basis, with or without the restriction 
that graphs are simple. It is done by constructing a connected Markov chain of graphs with 
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the common degree sequence. In each iteration, a primitive walk is randomly generated 
by Algorithm [2l If the primitive walk is applicable, a new sample graph with the same 
degree sequence is obtained by adding the primitive walk, otherwise the primitive walk 
is rejected. Note that Metropolis-Hastings algorithm does not require the uniformity of 
the distribution of generated primitive walks. As long as there is a positive probability of 
generating every element of the Graver basis, the Metropolis-Hastings algorithm realizes 
uniform sampling of graphs with the common degree sequence. 



5 Numerical experiments 

In this section we present numerical experiments with elements of the Graver basis com- 
puted by Algorithm [2] in Section HI The implementation of Metropolis-Hastings algo- 
rithm with Algorithm |2] is done by Java 1.6.0 on Windows OS with Intel(R) Core(TM) 
i7-2829QM CPU@2.30GHz. 



5.1 A simulation with a small graph 

We run a Markov chain over the fiber containing a small graph Hq in Figure [131 The 
underlying graph G = is assumed to be complete with eight vertices. By the Markov 
chain we sampled 510,000 graphs in the fiber, including 10,000 burn-in steps. The number 
of types of obtained graphs in our chain is 591. By enumeration we checked that 591 is 
actually the number of the elements of the fiber of Hq. The histogram of this experiment is 
shown in Figure [T4l The horizontal axis expresses the frequency of each type of graph and 
the vertical axis expresses the number of types. The mean of the number of appearances 
of each type is 829 and the standard deviation is 179. This experiment shows that the 
algorithm samples each element of the fiber almost uniformly. 




I I I I I I 

400 800 1200 

Figure 13: Small graph Hq. 

Frequency 

Figure 14: Histogram from sampling. 



5.2 The beta model for the food web data 

We apply Algorithm |2] for testing of the real data, the observed food web of 36 types of 
organisms in the Chesapeake Bay during the summer. This data is available online at 
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Blitzstein and Diaconis [l| analyzed essentially the same data set. 




Figure 15: Food web for the Chesapeake Bay during the summer. 



The graph H of the data is shown in Figure [151 The vertices represent the types of 
organisms like blue crab, bacteria etc., and the edges represent the relationship of one 
preying upon the other. The degree sequence of H is 

(9, 10, 6, 2, 3, 3, 9, 11, 6, 4, 6, 7, 5, 7, 8, 4, 3, 8, 
7, 2, 3, 11, 8, 2, 4, 5, 7, 4, 4, 4, 3, 5, 5, 2, 14, 29). 

Although there is a self loop at the vertex 19 in the observation, we ignored it for simplicity. 

We set the beta model ^ in Section [2] with riij = 1 for each edge {i,j} as the null 
hypothesis. Then the probability of H is described as 

P{H) oc ^^^f : ^. (4) 

Parameter (i G V) is interpreted as the value of organism represented by the vertex i 
as a food to other organisms. Then the beta model (jl]) implies that a vertex i with large 
ai is likely to be connected to many edges. Let P G (jl]) mean that P can be expressed 
by (H]) for a set of parameters {ai}i^v Consider now the statistical hypothesis testing 
problem 

Ho:Pe(^ versus ifi : P ^ Q. 

Starting from the graph in Figure [T5| we construct a Markov chain of 10,100,000 graphs 
including 100,000 burn-in steps and compute the chi-square statistic of each graph as a 
test statistic. The running time of the calculation is 5 minutes and 4.8 seconds. Using 
the maximum likelihood estimator, the chi-square value of observed graph H is 477 and 
the histogram of the estimated distribution of the chi-square values is shown in Figure 
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Chi-square statistic 

Figure 16: Histogram of chi-square statistic. 



[T6l The approximate p-value is 0.286. This value is not so small and there is no evidence 
against the beta model 

Next we consider some other characteristics of the observed graph H and graphs 
obtained by the above Markov chain. We compute their clustering coefficient defined 
by Wattz and Strogatz [23| and also count the number of triangles (3-cycles). For the 
observed graph H, the values of clustering coefficient and the number of triangles are 
0.447 and 101, respectively. For the sampled graphs, the histograms are obtained as in 
Figure [T7] and [TS] and their mean values are 0.436 and 92.4, respectively. The differences 
between the actual values and the means of sampled graphs are not large. It suggests 
that these statistics agree with the beta model (jlj). 



0.30 0.40 0.50 
Clustering coefficient 



80 90 100 110 



Number of triangles 



Figure 17: Histogram of clustering coeffi- Figure 18: Histogram of number of triangles, 
cient. 

As mentioned in Section [T] there are computer algebra systems such as 4ti2 (4ti2 team 
2l| ) to compute the Graver basis. However the whole Graver basis is huge and difficult 
to compute even for a moderate-sized graph like the real data above. Algorithm [21 our 
adaptive algorithm, enables us to perform the Markov chain Monte Carlo method for such 
a mo derate- sized graph. 
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6 Concluding remarks 



In this paper we obtained a simple characterization of the Graver basis for toric ideals 
arising from undirected graphs. This Graver basis allows us to perform the conditional 
test of the beta model for arbitrary underlying graph. Our characterization allows us to 
construct an algorithm for sampling elements of the Graver basis, which is sufficient for 
performing the conditional test. 

By numerical experiments we confirmed that our procedure works well in practice. 
We should mention that the sequential importance sampling method of Blitzstein and 
Diaconis (H may work faster for the case of complete underlying graph. 

If we allow multiple edges, then we do not need the Graver basis. A minimal Markov 
basis, which is often much smaller than the Graver basis, is sufficient for connectivity of 
Markov chains. Properties of Markov basis for the pi-model have been given in Petrovic 



et al [16|. It is of interest to study properties of minimal Markov bases for undirected 



graphs, including the case of allowing self loops. 
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A Proofs of Lemmas in Section 



K2 



A.l Proof of Lemma [2] 

The contraction of the edge with its weight ±2 on W is possible from Lemma [TJ We 
denote this edge by e = as shown in Figure Suppose w is not primitive. Then 




w w 



Figure 19: Contraction of an edge e. 



there exists an proper subwalk w' of w. If z ^ V{w'), w' is also a proper subwalk of a 
contradiction to the primitiveness of w. Then i G V{'w'). However, a proper subwalk of 
w is constructed by embedding e into W' . Therefore, w is primitive. □ 

A. 2 Proof of Lemma [3] 

We consider the case that both positive and negative sides of i correspond to (a) in Figure 
m and relevant edges are labeled as shown in Figure [201 Suppose wi is neither primitive 
nor of length two. Then there exists a proper subwalk w'l of wi on Wi. If e"*" ^ E{w'i), w'l 




Figure 20: Separation of a vertex i. 



is also a proper subwalk of ty, a contradiction to the primitiveness of w. Then G E{wi). 
Now w'l is expressed as follows: 

Wi = (cj-^, 6^2, . . . , Cij., e , eij._|_-^, . . . , ej^). 

Then an even closed walk on W 

is a proper subwalk of w. This contradicts the primitiveness of w. Therefore wi is 
primitive or of length two. The cases of (b) and (c) in Figure H] are shown in the same 
way. Note that it is easy to confirm the possibility of contraction after the step 1 in the 
case (b) from Lemma [1] and then the primitiveness is guaranteed by Lemma [2l By the 
same argument, the case of W2 is confirmed. □ 
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A. 3 Proof of Lemma [4] 

Let e be the new edge appearing through the insertion to i as shown in Figure [2T1 Suppose 




W Wi W2 

Figure 21: Insertion to a vertex i. 



w is not primitive. Then there exists a proper subwalk w' of w. li e ^ E{w'), w' is 
contained in Wi or W2- Then w' or its reverse becomes a proper subwalk of w. This 
contradicts the primitiveness of w. Hence e G E{w'). Then we can construct a proper 
subwalk of w by removing e from iS' and reversing the weights of edges belonging to 
E{wi), a contradiction to the primitiveness of w. Therefore, w is primitive. □ 

A. 4 Proof of Lemma [5] 

Let i be the new vertex appearing through the binding. We consider the case that both 
positive and negative sides of i correspond to (a) in Figure H] and relevant edges are 
labeled as shown in Figure |22l Other cases are shown in the same way. Suppose w is 




Figure 22: Binding of Wi and W2- 



not primitive. Then there exists a proper subwalk w' of w. Here we choose a primitive 
walk as w'. If i ^ V{w'), w' is also a proper subwalk of wi or W2- Then i G V{w'). This 
implies that all four edges connected to i appear in w'. Let us consider the separation of 
i to W. Then the resulting two weighted graphs W[,W2 are primitive from Lemma |3l 
Furthermore at least one of {i = 1, 2) is a proper subwalk of Wi, a contradiction to the 
primitiveness of Wi. Therefore w is primitive. □ 
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